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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/807 , 933B 



DATE: 01/22/2003 
TIME: 13:30:49 
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Input Set : A:\09807933SeqList.txt 
Output Set: N:\CRF4\01222003\l807933B.raw 

<110> APPLICANT: MEIJI SEIKA KAISHA, LTD. 

<120> TITLE OF INVENTION: Endoglucanase and cellulase composition containing the 
same 

<130> FILE REFERENCE: 121659PX 

<140> CURRENT APPLICATION NUMBER: US/09/807 , 933B 
<141> CURRENT FILING DATE: 2002-04-18 

<150> PRIOR APPLICATION NUMBER: JP302387 / 1998 
<151> PRIOR FILING DATE: 199^-^-23 
<160> NUMBER OF SEQ ID NOS */ 113, 
<170> SOFTWARE: Patent In V&s^>2. 0 
<210> SEQ ID NO: 1 
<211> LENGTH: 338 
<212> TYPE: PRT 

<213> ORGANISM: Rhizopus oryzae CP96001 
<220> FEATURE: 



sig_peptide 
(-23) . . (-1) 
mat_peptide 
(1) . . (315) 



<221> NAME /KEY 
<222> LOCATION 
<221> NAME/KEY 
<222> LOCATION 
<400> 1 

Met Lys Phe lie Thr lie Ala Ser Ser Ala Leu Leu Ala Leu Ala Leu 

-20 -15 -10 

Gly Thr Glu Met Ala Ser Ala Ala Glu Cys Ser Lys Leu Tyr Gly Gin 

-5 15 
Cys Gly Gly Lys Asn Trp Asn Gly Pro Thr Cys Cys Glu Ser Gly Ser 
10 15 20 25 

Thr Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Ser Gly 

30 35 40 

Ser Ser Gly Asn Lys Ser Ser Glu Ser Ala His Lys Lys Thr Thr Thr 

45 50 55 

Ala Ala His Lys Lys Thr Thr Thr Ala Ala His Lys Lys Thr Thr Thr 

60 65 70 

Ala Pro Ala Lys Lys Thr Thr Thr Val Ala Lys Ala Ser Thr Pro Ser 

75 80 85 

Asn Ser Ser Ser Ser Ser Ser Gly Lys Tyr Ser Ala Val Ser Gly Gly 
90 95 100 105 

Ala Ser Gly Asn Gly Val Thr Thr Arg Tyr Trp Asp Cys Cys Lys Ala 

110 115 120 

Ser Cys Ser Trp Pro Gly Lys Ala Asn Val Ser Ser Pro Val Lys Ser 

125 130 135 

Cys Asn Lys Asp Gly Val Thr Ala Leu Ser Asp Ser Asn Ala Gin Ser 

140 145 . 150 

Gly Cys Asn Gly Gly Asn Ser Tyr Met Cys Asn Asp Asn Gin Pro Trp 



ENTERED 
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RAW SEQUENCERS STING DATE : 1)1/22/2003 

PATENT APPLICATION: US/09/807 , 933B TIME: 13:30:49 

Input Set : A:\09807933SeqList.txt 
Output Set: N:\CRF4\01222003\l807933B.raw 

67 155 160 165 

69 Ala Val Asn Asp Asn Leu Ala Tyr Gly Phe Ala Ala Ala Ala lie Ser 

70 170 175 180 185 

72 Gly Gly Gly Glu Ser Arg Trp Cys Cys Ser Cys Phe Glu Leu Thr Phe 

73 190 195 200 

75 Thr Ser Thr Ser Val Ala Gly Lys Lys Met Val Val Gin Val Thr Asn 

76 205 210 215 

78 Thr Gly Gly Asp Leu Gly Ser Ser Thr Gly Ala His Phe Asp Leu Gin 

79 220 225 230 

81 Met Pro Gly Gly Gly Val Gly lie Phe Asn Gly Cys Ser Ser Gin Trp 

82 235 240 245 

84 Gly Ala Pro Asn Asp Gly Trp Gly Ser Arg Tyr Gly Gly lie Ser Ser 

85 250 255 260 265 

87 Ala Ser Asp Cys Ser Ser Leu Pro Ser Ala Leu Gin Ala Gly .Cys Lys 

88 270 275 280 

90 Trp Arg Phe Asn Trp Phe Lys Asn Ala Asp Asn Pro Ser Met Thr Tyr 

91 285 290 295 

93 Lys Glu Val Thr Cys Pro Lys Glu lie Thr Ala Lys Thr Gly Cys Ser 

94 300 305 310 
96 Arg Lys 

.97 315 

99 <210> SEQ ID NO: 2 

100 <211> LENGTH: 1017 

101 <212> TYPE: DNA 

102 <213> ORGANISM: Rhizopus oryzae CP96001 

104 <220> FEATURE: 

105 <221> NAME /KEY : sig_peptide 

106 <222> LOCATION: (1)..(69) 

107 <221> NAME /KEY : mat_peptide 

108 <222> LOCATION: (70).. (1017) 
W — > 110 <400> 2 

111 atg aag ttt att act att gcc tct tec get etc ttg get etc gee etc 48 

112 Met Lys Phe lie Thr lie Ala Ser Ser Ala Leu Leu Ala Leu Ala Leu 

113 -20 -15 -10 

115 ggt act gaa atg gcc tct get get gaa tgt age aaa ttg tat ggt caa 96 

116 Gly Thr Glu Met Ala Ser Ala Ala Glu Cys Ser Lys Leu Tyr Gly Gin 

117 -5 1 5 

119 tgt ggt ggt aag aac tgg aat ggc cct act tgt tgt gaa tct gga tec 144 

120 Cys Gly Gly Lys Asn Trp Asn Gly Pro Thr Cys Cys Glu Ser Gly Ser 

121 10 15 20 25 

123 acc tgt aaa gta age aac gat tac tac tct caa tgt ctt ccc tct gga 192 

124 Thr Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Ser Gly 

125 30 35 40 

127 age agt ggc aat aaa tct tct gaa agt get cac aag aag act acc act 240 

128 Ser Ser Gly Asn Lys Ser Ser Glu Ser Ala His Lys Lys Thr Thr Thr 

129 45 50 55 

131 get get cac aag aag act act acc get get cat aaa aag act acc act 288 

132 Ala Ala His Lys Lys Thr Thr Thr Ala Ala His Lys Lys Thr Thr Thr 

133 60 65 70 
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RAW SEQUENCE LISTING DATE: t) 1/2 2/2 003 

PATENT APPLICATION : ■ US/09/807 , 933B TIME: 13:30:49 

Input Set : A:\09807933SeqList.txt 
Output Set: N:\CRF4\01222003\l807933B.raw 

135 get cct get aag aag act aca act gtt gec aaa get tec ace cct tct 336 

136 Ala Pro Ala Lys Lys Thr Thr Thr Val Ala Lys Ala Ser Thr Pro Ser 

137 75 80 85 

139 aac tct age tct age tec age ggc aaa tat tec get gtc tct ggt ggt 384 

140 Asn Ser Ser Ser Ser Ser Ser Gly Lys Tyr Ser Ala Val Ser Gly Gly 

141 90 95 100 105 

143 gee tct ggt aac ggt gtc act act cgt tat tgg gat tgc tgt aag gee 432 

14 4 Ala Ser Gly Asn Gly Val Thr Thr Arg Tyr Trp Asp Cys Cys Lys Ala 

145 110 115 120 

147 tec tgt age tgg ccc ggt aag gee aat gtc agt tct cct gtc aag tec 480 

14 8 Ser Cys Ser Trp Pro Gly Lys Ala Asn Val Ser Ser Pro Val Lys Ser 
149 125 130 135 

151 tgt aac aaa gat ggt gtc act gec ctt agt gac age aat gee caa agt 528 

152 Cys Asn Lys Asp Gly Val Thr Ala Leu Ser Asp Ser Asn Ala Gin Ser 

153 140 145 150 

155 ggc tgt aac ggt ggt aac agt tac atg tgt aac gac aac caa cct tgg 576 

15 6 Gly Cys Asn Gly Gly Asn Ser Tyr Met Cys Asn Asp Asn Gin Pro Trp 
157 155 160 165 

159 get gta aac gac aac ctt gee tat ggt ttc get get get gec ate agt 624 

160 Ala Val Asn Asp Asn Leu Ala Tyr Gly Phe Ala Ala Ala Ala He Ser 

161 170 175 180 185 

163 ggt ggt ggt gaa tct cgc tgg tgc tgt tct tgt ttc gaa ctt act ttc 672 

164 Gly Gly Gly Glu Ser Arg Trp Cys Cys Ser Cys Phe Glu Leu Thr Phe 

165 190 195 200 

167 act tct acc tct gtt get ggt aag aag atg gtt gtc caa gtc act aac 720 

168 Thr Ser Thr Ser Val Ala Gly Lys Lys Met Val Val Gin Val Thr Asn 

169 205 210 215 

171 act ggt ggt gat ctt ggc tec tct act ggt get cac ttt gac ttg caa 768 

172 Thr Gly Gly Asp Leu Gly Ser Ser Thr Gly Ala His Phe Asp Leu Gin 

173 220 225 230 

175 atg ccc ggt ggt ggt gtt ggt att ttc aat ggt tgt tec age caa tgg 816 

176 Met Pro Gly Gly Gly Val Gly He Phe Asn Gly Cys Ser Ser Gin Trp 

177 235 240 245 

179 ggt get ccc aat gac ggt tgg ggc tea aga tac ggt ggt att tct tct 8 64 

180 Gly Ala Pro Asn Asp Gly Trp Gly Ser Arg Tyr Gly Gly He Ser Ser 

181 250 255 260 265 

183 gca tct gac tgc tct agt ctt cct tec gca etc caa get ggt tgt aaa 912 

184 Ala Ser Asp Cys Ser Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys Lys 

185 270 275 280 

187 tgg aga ttc aac tgg ttc aag aac get gat aac cca age atg act tac 960 

188 Trp Arg Phe Asn Trp Phe Lys Asn Ala Asp Asn Pro Ser Met Thr Tyr 

189 285 290 295 

191 aag gaa gtt acc tgt cct aag gaa ate acc gec aag aca ggt tgt tea 1008 

192 Lys Glu Val Thr Cys Pro Lys Glu He Thr Ala Lys Thr Gly Cys Ser 

193 300 305 310 

195 aga aaa taa 1017 

196 Arg Lys 

197 315 

199 <210> SEQ ID NO: 3 
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RAW SEQUENCE^tSTING DATE rT)l /22/2003 

PATENT APPLICATION: US/09/807 , 933B TIME: 13:30:49 

Input Set : A:\09807933SeqList.txt 
Output Set: N:\CRF4\01222003\l807933B.raw 

200 <211> LENGTH: 366 

201 <212> TYPE: PRT 

202 <213> ORGANISM: Rhizopus oryzae CP96001 

204 <220> FEATURE: 

205 <221> NAME/KEY: sig_peptide 

206 <222> LOCATION: (-23) . . (-1) 

207 <221> NAME/KEY: mat^peptide 

208 <222> LOCATION: (1)..(343) 
W— > 210 <400> 3 

211 Met Lys Phe He Thr He Thr Ser Ser Ala Leu Leu Ala Leu Ala Leu 

212 -20 -15 -10 

214 Gly Thr Glu Met Ala Ser Ala Ala Lys Cys Ser Lys Leu Tyr Gly Gin 

215 -5 15 

217 Cys Gly Gly Lys Asp Trp Asn Gly Pro Thr Cys Cys Glu Ser Gly Ser 

218 10 15 20 25 

220 Thr Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Ala Pro Glu 

221 30 35 40 

223 Ser Asn Gly Asn Lys Ser Ser Glu Cys Ser Lys Leu Tyr Gly Gin Cys 

224 45 50 55 

226 Gly Gly Lys Asp Trp Asn Gly Pro Thr Cys Cys Glu Ser Gly Ser Thr 

227 60 65 70 

229 Cys Lys Val Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Ala Pro Glu Ser 

230 75 80 85 

232 Asn Gly Asn Lys Thr Ser Glu Ser Ala His Lys Thr Thr Thr Thr Thr 

233 90 95 100 105 

235 Ala Pro Ala Lys Glu He Thr Thr Thr Ala Lys Ala Ser Asn Ser Ser 

236 110 115 120 

238 Asn Ser Ser Gly Lys Tyr Ser He Val Ser Gly Gly Ala Ser Gly Asn 

239 125 130 135 

241 Gly Val Thr Thr Arg Tyr Trp Asp Cys Cys Lys Ala Ser Cys Ser Trp 

242 140 145 150 

244 Pro Gly Lys Ala Asn Val Ser Ser Pro Val Lys Ser Cys Asn Lys Asp 

245 155 160 165 

247 Gly Val Thr Ala Leu Ser Asp Ser Asn Val Gin Ser Gly Cys Asn Gly 

248 170 175 .180 185 

250 Gly Asn Ser Tyr Met Cys Asn Asp Asn Gin Pro Trp Ala Val Asn Asp 

251 190 195 200 

253 Asn Leu Ala Tyr Gly Phe Ala Ala Ala Ala He Ser Gly Gly Gly Glu 

254 205 210 215 

256 Ser Arg Trp Cys Cys Ser Cys Phe Glu Leu Thr Phe Thr Ser Thr Ser 

257 220 225 - 230 

259 Val Ala Gly Lys Lys Met Val He Gin Val Thr Asn Thr Gly Gly Asp 

260 235 240 245 

262 Leu Gly Ser Ser Thr Gly Ala His Phe Asp Leu Gin Met Pro Gly Gly 

263 250 255 260 265 

265 Gly Val Gly He Phe Asn Gly Cys Ser Lys Gin Trp Gly Ala Pro Asn 

266 270 275 280 

268 Asp Gly Trp Gly Ser Arg Tyr Gly Gly lie Ser Ser Ala Ser Asp Cys 

269 285 290 295 
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PATENT APPLICATION: US/09/807 , 933B 



DATE: 01/22/2003 
TIME: 13:30:49 



Input Set : A:\09807933SeqList.txt 
Output Set: N:\CRF4\01222003\l807933B.raw 



271 


Ser 


Ser 
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Pro 


Ser 


Ala 


Leu 
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Gly 


Cys 
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Trp 


Arg 


Phe 


Asn 
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300 
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310 
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Trp 


Phe 


Lys 


Asn 


Ala 


Asp Asn 


Pro 


Ser 


Met 


Thr 


Tyr 


Lys 


Glu 


Val 


Thr 
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315 
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Cys 


Pro 


Lys 


Glu 


lie 


Thr 


Ala 


Lys 


Thr 


Gly 


Cys 


Ser 


Arg 


Lys 
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<210> SEQ ID NO: 


4 


























281 

il U J. 


<211> LENGTH: 1101 
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y 


<212> TYPE: 


DNA 




























O J 


<213> ORGANISM: 


Rhizopus 


; oryzae 


CP96001 
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<220> FEATURE: 




























o u 


<221> NAME/KEY: 


a 

siy 


peptide 






















987 


<222> LOCATION: 


\ 1 ) - 


(69) 






















98ft 
zoo 


<221> NAME/KEY: 


ma l 


peptide 






















98 Q 


<222> LOCATION: 


{ /V) 


. (1101) 






















ri /L^7 J. 


<400> 4 
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/~% /"-> +- 
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etc 
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*W X^-' 


ctt 


48 


Z _? J 


Met 


Lys 


T~j U. _ 

Fne 


lie 


i nr 


lie 


i nr 


C es. >» 


C £J T~ 

oer 


Ma 


Leu 


Leu 


Ala 


Leu 


Ala 


Leu 




Z fl 






o n 
— ZU 




















-10 










ggu 




gaa 


a ug 


/—i 

got 


LCl 


/T s~~s -f— 

get 




3 3 3 

dad 
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C*. u 


ct a 


tat 


aat 


caa 


96 


997 




Thr 


C 1 n 
\j± U 


L ic; L. 


Ma 


Qci >~ 
Oci 


Ala 


iild 


juys 




Ser 


Lys 


Leu 


Tvr 


Gly 


Gin 




9 Qft 

Z J o 
















1 
X 








5 












jUU 


tgt 


ggt 


ggt 


aag 




tgg 


aat 


/~r /~r 

ggc 






tat 


tac 


aaa 

i,j 


tct 


aaa 


tec 


144 




Cys 


Gly 


Gly 


Lys 




Trp 


Asn 






X 1 IX, 


Cvs 


Cvs 


Glu 


Ser 


Gly 


Ser 




^09 


10 










15 










20 










*25 






acc 


tgt 


aaa 


gta 




aac 


gat 


tac 


tac 


tct 


caa 


tat 


ctt 


acc 


cct 


gaa 


192 




Thr 


Cys 


Lys 


Val 


Qpr 


Asn 


Asp 


Tvr 

x y x. 


Tvr 


Ser 

kd/ 


Gin 


Cvs 


Leu 


Ala 


Pro 


Glu 














o u 




















40 






^Oft 


age 


aac 


ggc 


aat 


aag 


tct 


tct 


gaa 


tgt 


age 


aaa 


ttg 


tat 


aat 


caa 


tqt 


240 


~J \J Zf 


Ser 


Asn 


Gly 


Asn 


Lys 


Ser 


Ser 


Glu 


Cys 


Ser 


Lys 


Leu 


Tyr 


Gly 


Gin 


Cys 












45 










50 










55 








"31 2 


ggt 


ggt 


aag 


gac 


tgg 


aat 


ggc 


cct 


act 


tgt 


tgc 


gaa 


tct 


aaa 

Z? z3 ^ 


tec 


acc 


288 


31 3 

«J X. J 


Gly 


Gly 


Lys 


Asp 


Trp 


Asn 


Gly 


Pro 


Thr 


Cys 


Cys 


Glu 


Ser 


Gly 


Ser 


Thr 










60 










65 










70 










■J J. u 


tgt 


aaa 


gta 


age 


aac 


gat 


tac 


tac 


tct 


caa 


tqt 


ctt 


qcc 


cct 


gaa 


age 


336 


317 


Cys 


Lys 


Val 


Ser 


Asn 


Asp 


Tyr 


Tyr 


Ser 


Gin 


Cys 


Leu 


Ala 


Pro 


Glu 


Ser 




318 

<w> ^4. X*/ 




75 










80 










85 












^90 


aat 


ggc 


aat 


aaa 


act 


tct 


gaa 


age 


get 


cat 


U s^i. 


aca 


act 


act 


acc 


act 


384 


321 


Asn 


Gly 


Asn 


Lys 


Thr 


Ser 


Glu 


Ser 


Ala 


His 


Lys 


Thr 


Thr 


Thr 


Thr 


Thr 




322 


90 










95 










100 










105 




324 


get 


ccc 


get 


aag 


gaa 


att 


aca 


act 


act 


gee 


aaa 


get 


tea 


aac 


tct 


tct 


432 


325 


Ala 


Pro 


Ala 


Lys 


Glu 


He 


Thr 


Thr 


Thr 


Ala 


Lys 


Ala 


Ser 


Asn 


Ser 


Ser 




326 










110 










115 










120 






328 


aac 


tct 


age 


ggc 


aaa 


tac 


tec 


att 


gtc 


tct 


ggt 


ggt 


gee 


tct 


ggt 


aac 


480 


329 


Asn 


Ser 


Ser 


Gly 


Lys 


Tyr 


Ser 


He 


Val 


Ser 


Gly 


Gly 


Ala 


Ser 


Gly Asn 




330 








125 










130 










135 








332 


ggt 


gtc 


act 


act 


cgt 


tat 


tgg 


gat 


tgc 


tgt 


aag 


gee 


tec 


tgt 


age 


tgg 


528 


333 


Gly 


Val 


Thr 


Thr 


Arg 


Tyr 


Trp 


Asp 


Cys 


Cys 


Lys 


Ala 


Ser 


Cys 


Ser 


Trp 




334 






140 










145 










150 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/09/807 , 933B 




DATE: 01/22/2003 
TIME: 13:30:50 



Input Set : A:\09807933SeqList.txt 
Output Set: N:\CRF4\01222003\l807933B.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 



Seq#: 


17; 


Xaa 


Pos 


. 1,2, 3, 4, 5, 6, 11, 12, 13, 14, 16, 17, 18,20,21,22,23,24,25,27, 28 


Seq#: 


17; 


Xaa 


Pos 


. 29,30,31, 33, 34, 36, 39 


Seq#: 


18; 


Xaa 


Pos 


. 1,2,3,4,5, 6, 11, 12, 13, 14, 16, 17, 18,20,21,22, 23, 24,25,27,28 


Seq#: 


18; 


Xaa 


Pos 


. 29, 30,31, 33, 34, 36, 39 


Seq# : 


19; 


Xaa 


Pos 


. 3, 4, 6, 11, 12, 14, 20, 21, 23, 26, 27, 28, 29, 30, 32, 33, 38 


Seq#: 


20; 


Xaa 


Pos 


. 3,4, 6, 11, 12, 14,20,21,23,26,27,28,29,30,32,33,38 


Seq#: 


21; 


Xaa 


Pos 


. 4, 6,12,14,26,27, 28,29, 30, 32, 33,38 


Seq#: 


25; 


Xaa 


Pos 


. 6, 14, 20, 21, 26, 29, 30, 38 


Seq#: 


29; 


Xaa 


Pos 


. 1,5,7,8,9,10,11,12 


Seq#: 


30; 


Xaa 


Pos 


. 2,3,4,8 


Seq# : 


31; 


Xaa 


Pos 


. 2,3,4,5,8,9 


Seq# : 


32; 


Xaa 


Pos 


. 2,3,4,5,8,9 


Seq#: 


43; 


N Pos. 


15,18 


Seq#: 


44; 


N Pos. 


15 
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